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Abstract 

We consider a type of long-range percolation problem on the positive integers, 
motivated by earlier work of others on the appearance of (in)finite words within a 
site percolation model. The main issue is whether a given infinite binary word ap- 
pears within an iid Bernoulli sequence at locations that satisfy certain constraints. 
We settle the issue in some cases, and provide partial results in others. 



1 Introduction 

Let W = {wi, W2, . . . ) G {0, 1}^ be an infinite binary word, and X = (Xi, X2, . . . ) and 
Y = {Yi,Y2, . . .) be independent Bernoulli sequences (i.e., iid sequences of Bernoulli 
random variables) with parameters px = P{Xi = 1),Py = P(Xj = 1) ^ (0, 1). Let M 
be a positive integer. An admissible (M- jembedding of 1^ in F is a sequence {rrii : i > 1) 
of integers such that Ym, = Wi and 1 < rrii — rrii^i < M for each i > 1. (By default, we 
take mo = 0.) We say that W is M-seen in Y if there exists an M-admissible embedding 
of W in Y. In this paper, we ask whether or not the events {W is M-seen in Y} and 
{X is M-seen in Y} can have strictly positive probability. 

This question is motivated by analogous questions considered in [21 [6] concerning 
percolation of words on graphs such as Z'^. These questions were partially answered for 
large c? in |2j, and on a modified version of in [S]. A version of the above question has 
been answered in the affirmative in [7] for large M and d = 2. Our problem may be set 
in the context of long-range percolation, through a consideration of the oriented graph 
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with vertices {1,2,...} in which there is an edge from itojifl<j— i< M. In this 
setting, our problem corresponds to ordinary site percolation when W is the constant 
word Wi = 1, and to so-called AB site percolation when W is the alternating word with 
W2i = 0,W2i-i = 1 (or vice versa). 

A further formulation of the problem resembles the famous problem of the clairvoyant 
demon posed by Peter Winkler. As above, let X and Y be independent Bernoulli 
sequences with parameters Px,Py'i for simplicity we assume = = |- We color the 
point (^, j) of the first orthant of the square lattice red if Xi = Yj. Let the origin 
(0, 0) be red also. For M > 1, we define an M-admissible path to be an infinite sequence 
m = (mo, mi, m2, . . . ) satisfying mo = and 1 < m^+i — rrii < M for all i, such that 
every point {i,mi), i > 0, is red. There exists an M-admissible path if and only if X is 
M-seen in Y. For references to the clairvoyant demon problem, and for solutions to the 
related problem in which the admissible paths for that problem are permitted to move 
upwards or downwards at each stage, see [HE]- 

Since Y contains arbitrarily long sequences of O's and arbitrarily long sequences of 
I's, it is easy to see that P{W is M-seen in Y) =0 when W is periodic. However, 
the situation for general words is not so clear. In Section 5, we will show that the 
truth of the statements "for every M, P{X is M-seen in Y) = 0" and "for every M, 
P{W is M-seen in Y) = 0" (for an arbitrary infinite word W) is independent of the 
parameters Px,Py of the Bernoulli sequences X,Y; see Theorem [T31 Therefore, except 
in that section, we will assume that 

Px=Py = i 

In order to gain some insight into our problem, we consider the probability of M- 
seeing finite words W, and particularly how this probability depends on W. Let 



It is easy to check (as we will do in Section 3) that the probability of M-seeing a given 
word of length n is minimized by the constant word W = (1, 1, . . . , 1) of length n, 
and that in this case, this probability equals a". In the other direction, we consider 
the alternating word An = (1,0,1,0,...) of length n; we could equally consider the 
alternating word beginning with 0. Since the infinite alternating word is periodic, the 
probability 

Vn = PiAn is M-seen in Y) (1) 

tends to zero as n — > oo. In Section 2, we will show how to compute Vn exactly, and 
hence determine the exponential rate at which this probability tends to zero. If M = 2, 
for example, f„ ~ c(0.85 ...)"■. We will prove that the alternating word is most likely 
to be seen in two cases: 

Theorem 1. (a) Let M = 2 and n > 1. For any word W of length n, 

P{W is M-seen in Y) < Vn- 
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(b) Let M >2, and let Wp^g be the word (1,1,1, ...,0,0,0) comprising p I's followed by 
q 's. Then 

P{Wp^q is M-seen in Y) < Vp+g. 

The first part of this theorem will be proved in Section 2, and the second in Section 4. 
As a consequence of Theorem 1(a), we have the following solution to our main problem 
in case M = 2: 

Corollary 2. If M = 2, P{W is M-seen in y) = for every infinite word W, and thus 
P{X is M-seen in Y) = 0. 

A useful tool in our analysis is the following sequence of 'spacing' random variables. 
Given a finite or infinite word W, define Tq — and, recursively, 

Tk+i = mm{i > Tk : Yi = Wk+i}, Tk+i = Tk+i - T^. 

Note that, while the values of ti,T2, . . . depend on the choice of VF, for any W they 
are iid random variables with the geometric distribution with parameter |. The values 
of Ti, T2, . . . , r„ do not in general determine whether or not the word (wi, W2, • • • , w„) is 
M-seen. (An example illustrating this is given in Section 3.) However, they do so for 
the constant and alternating words. 

Theorem 3. (a) The constant word of length n is M-seen in Y if and only if Tk < M 
for all 1 < k < n. 

(b) The alternating word An of length n is M-seen in Y if and only if 

Tk < kM for alll<k<n and Tk - Tj < {k - j + 1)M for all < j < k < n. 
This theorem will be proved in Section 3. 

Let be a word of length n, and let Nn = Nn{W) be the number of M-admissible 
embeddings of in F. It is easy to see that 

E(Nn) = (M/2)" 

The second moment of Nn can be expressed in the following way. Let J = ( Jq. Ji, J25 • • • ) 
and K — {Kq, Ki, K2, . . .) be independent random walks on Z starting at Jq = Kq ~ 
with, as step-size distribution, the uniform distribution on the finite set {1,2,..., M}. 

Theorem 4. (a) For any word W of length n, and any M > 1, 

E{N^) ^ EiNnfE i H 2-l{wr^Ws)\, (2) 

\{r,sy.Jr=Ks J 

where the product is over r, s e {1, 2, . . . , n}, and 1{A) denotes the indicator function of 
A. 
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(b) Let X be the random word of length n, comprising random letters with the Bernoulli 
(|) distribution. Then 

E{N^{Xf) = E{N^{X)fE{2'-), 

where Zn is the number of visits to zero between times 1 and n made by the random walk 
J -K. 

This theorem will be proved in Section 6. We will also see there that E{2^") is 
asymptotic to a constant multiple of for some cm > 1- In the case M = 2, we 
have C2 = |. By part (a) of this theorem, the constant word maximizes the variance of 

Nn{W). 

2 Recursions for Vn 

Let An = (1,0,1,0,...) be the alternating word of length n starting with 1. (By 
symmetry, probabilities for alternating words starting with are the same as for A„.) 
In this section, we first compute the f„ given in ([T]), and then we prove Theorem 1(a). 
The computation of f„ is done recursively; the recursions satisfied by Vn will be used in 
Section 4 in the proof of Theorem 1(b). 

If an admissible embedding of An in Y exists, we define the standard embedding to 
be that whose sequence of positions [rrii : i > 1) is earliest in the usual lexicographic 
order. We will use the following notation: 

• Vn,k '■= the probabihty that An possesses an admissible embedding, and its stan- 
dard embedding starts at position k. 

• Vn '■= Ylik=i '^ri,k = the probability that An possesses an admissible embedding. 

• V'n:= Vn,M- 

Proposition 5. The sequences Vn and v'^ satisfy the following recursions: 

Vn = aVn-1 + {a - Ml3)v'^_^ and v'n = /Svn-i + (M - l)/3Vn_^ (3) 
for n > 1, with initial conditions Vq = 1 and v'q = 0, and 

Vn+i = {a + {M- l)P)vn - P{M - 2a)vn^i (4) 
for n > 1, with initial conditions f o = 1 and V\ = a. 

Note that in ([3]), unlike (jl]), all the coefficients are nonnegative. This will enable us 
to compare solutions to recursive inequalities. 

It is easy to solve the recursion (jl]) explicitly. The characteristic polynomial is 

/(A) = - (a + (M - 1)/5)A + /?(M - 2a). 
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Note that, for M > 2, 

/(O) = (3{M - 2a) > 0, f{Ml3) = 2(3{Ml3 - a) < 0, 
f{a)=(3{Mf3-a)<0, /(I) = 2/5^ > 0. 

Therefore, the two roots of / he in the disjoint intervals (0, M/9) and (a, 1), respectively. 
This implies that — exponentially fast as n — *■ cxo, and is a quantification of the 
observation that Y contains no admissible embedding of An if there appears, sufficiently 
early in Y, a consecutive subsequence of M + 1 letters all of which are (respectively 
1). A word of caution: while Vn tends to zero exponentially rapidly, the convergence can 
still in a sense be quite slow. For example, if M = 5, the larger root of / is .9978 .... 

Proof. A Bernoulli sequence Y contributing to Vn,k niust satisfy 1^ = 1, and the preced- 
ing sequence (Yi, . . . cannot contain 1 and in that order, since if it did, there 
would be an admissible embedding starting before position k. Therefore, (Yi, . . . , Yfc-i) 
must be of the form (0, . . . , 0, 1, . . . , 1). We distinguish two cases for the starting se- 
quence: all O's (Case 1) and at least one 1 (Case 2). In Case 1, the only condition 
on (Yfc+i, . . . ) is that it must contain an admissible embedding of the remainder of An- 
In Case 2, (Y^+i, . . . ) must contain an admissible embedding of the remainder whose 
standard embedding starts at (relative) position M. (If it started earlier, then the 1 at 
position k — 1 would initiate an earlier embedding.) This yields the recursion 

1 k — 1 

Vn,k = -^Vn-i + —^Vn-iM for 1 < /c < M, n > 1, (5) 

with initial condition ^ = for 1 <k < M. 

The first relation in ([3]) is obtained by summing ([5]) over /c, and the second by setting 
k = M. Finally, one can eliminate v'^ from ([3]) to obtain (jlj). □ 

Proof of Theorem 1(a). Let M = 2 and let W G {0, l}*^ be a word of length n. For 
1 < m < ra, let Wm be the word comprising the last m digits of W . We will use notation 
similar to that at the beginning of this section. 

• Wm,k '■= the probability that possesses an admissible embedding, and its stan- 
dard embedding starts at position k e {1, 2}. 

• Wm '■= u)m,i + Wm,2 = the probability that Wm has an admissible embedding. 

We follow the same procedure as we did for the alternating word Unlike that case, 
we shall obtain only a recursive estimate from above. 

Denote the first digit of Wm by a e {0, 1}, and let 6 = 1 — a be the complementary 
digit. A Bernoulli sequence Y contributing to Wm,i must contain a at position 1 and 
the word Wm-i following. This gives 

Wm,l = |Wm-l- (6) 



5 



If Y contributes to Wm,2, there are two cases. If Y starts with a, then the second digit 
must also be a, and subsequently the Y must contain Wm-i, but not starting at the 
next digit. (Otherwise there would be a standard embedding starting at position 1.) If 
Y starts with b, then the second digit has to be a, and subsequently Y must contain 
Wm-i- This gives 

Wm,2 < jWm-1,2 + jW^-l- (7) 

Note that equality need not hold in (jTj). Suppose that both Y and Wm begin with 
the letters aa, and that (I3, F4, . . . ) contains an admissible embedding of both Wm-2 
and Wm-i- In this case, Y does not contribute to the left side of ([7]) but it does to the 
first term on the right side. 

From ([n])-©, we deduce the recursive inequalities 

Wra < \Wyn-l + \w'rn-l and w'^ < \Wm~l + whcrC Wi = \,w[ = \. 

By comparison with the recursion formula ([3]) with M = 2, 

Vm = jv-m-i + jv'^-i and v'^ = \vm~\ + \v'm-\. whcre vx = % v'^ = \, (8) 

we obtain by induction on m and the positivity of the coefficients in ([8]) that Wm < Vm 
for 1 < m < ra. In particular, Wn < f„ as claimed. □ 

Remark. The above method does not work for M > 3, since in this case the coefficients 
of the recursive inequalities for U7„, w'^ do not match the coefficients of the recursion for 

3 Relations to the spacing random variables 

In this section, we prove several results relating M-seen finite words to inequalities 
satisfied by the spacing variables and their partial sums. When the word is either 
constant or alternating, these are equivalences, and were stated as Theorem [3] in the 
Introduction. For general words, we only have one direction — if the word is seen, then 
the spacing variables satisfy certain inequalities. We will say that W = {wi,W2, ■ ■ ■) is 
seen at (mi, m2, . . . ) if mi < m2 < ■ ■ ■ and 1^. = for each i. 

First, we give an example to show that seeing a word W of length n is not in general 
determined by the values of ri, . . . , r„. Suppose n = 4, M = 2, W = {1,1, 0, 0), and the 
Bernoulh sequence starts with 110110 ■ ■ ■ . Then ti = 1, T2 = 1, = 1, T4 = 3. If W is 
to be seen, then it must be seen at locations mi = 2, 1712 = 4, = 6, and = 7 or 
8. Thus, it is seen if and only if one of the next two digits in the Bernoulli sequence is 
a 0, but this cannot be determined from the first four r^. 

Proposition 6. Let W = {wi, . . . ,Wn)- If W is seen at (mi, . . . ,mn), then Tk < rrik 
for 1 < k < n. In particular, if W is M-seen, then Tk < kM for all 1 < k < n. 
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Proof. Let mo := Tq = 0. We will prove Tk < rrik by induction on k. For the induction 
step we assume < m^.. Let a = w^+i- By definition T^+i is the first location of an 
a after location T^, and mk+i is some location of an a after location > T^, which 
immediately implies T^+i < mk+i- Finally, if W is M-seen, then 



The next result implies that the probability of M-seeing a word of length n is mini- 
mized by the constant word, and in that case, this probability is a^. 

Proposition 7. (a) If W is a word of length n and ti < M , . . . , r„ < M , then W is 
M-seen. 

(b) // a constant word of length n is M-seen, then ti < M , . . . , Tn < M . 

Proof. For part (a), note that if Xj < M for each i < n, then W is M-seen at 
(Ti,...,T„). For part (b), suppose that the constant word (a, a, . . . , a) of length n 
is seen at (mi, . . . ,m„) where 1 < m, — mj_i < M for each i, i.e. up to location m„ 
there is no block of M consecutive non a's. As T„ < m„ by Proposition [6l this implies 
Ti < M for all i <n. □ 

Proposition 8. Let W = {wi, . . . ,Wn)- Suppose that W is seen at (mi, . . . ,m„) and 
that < mj_|_i — rrii < M for each i. If ^ uik+i = ■ ■ ■ = wi for some k + 1 < I and if 
Ti > M, then Ti < m^+i . 

Proof. Let a = wi. In between locations Ti_i and Ti there is a block of at least M 
consecutive non a's. As there have to be a's at locations m^+i < ■ ■ ■ < m; and we have 
mj+i — rrii < M, this block has to be before location m^+i or after location mi, i.e. 
Ti < nik+i or mi < Ti^i. By Proposition [6] we have T/ < mi, so the second alternative 
is not possible. □ 

Proposition 9. Let An be an alternating word of length n. Then An is M-seen if and 



Tk < kM for alll<k<n and Tk-Tj < {k-j + l)M for all < j < k < n. (9) 
Proof. A special property of an alternating word is that 



k 




1=1 



□ 



only if 



Yi = Wk for Tk <i < Tk+i. 



(10) 



Define 



Sk = min{Tfc+i - 1, Sk-i + M}, Sq = 0, (Tk = Sk - Sk 



1- 
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Note that Sk < T^+i and ak < M. Since Tk is strictly increasing in k, we see inductively 
that Sk is strictly increasing in k also. Therefore cjfc > 1 for all > 1. Consider the 
statement 

Tfc < 5fc for all 1 < A; < n. (11) 
We will prove the following implications: 

^ ^ An is M-seen ^ (ED ^ 

First suppose that ( ITTi) holds. Then Tk < Sk < Tk+i, so that Ys,. = Wk by ( ITOi) . 
Since ak < M, it follows that An is M-seen, since it is seen at {Si, . . . , S'„). 

Next assume that An is seen at (mi, . . . , m„) where mi — mi_i < M for all i. The first 
part of (Q follows from Proposition [6l To prove the second part, let < j < A; < n. As 
{Yi : Tj < i < Tk) consists of k — j constant blocks, the interval [Tj,Tk) contains at most 
k — j consecutive elements from mi, . . . , rrin, i.e. mi < Tj < m^+i < m^+r < Tk < mj+^+i 
for some / and r < k — j. So Tk — Tj < mi+^+i — rrii = X]!=I('^i+i ~ ""^i) ^ + l)M. 

Finally, assume that holds. To prove (ITT]) , we will prove the statement 

Tk<Si + {k- i)M (12) 

by induction on i (for fixed k). When i = 0, f|T2l) becomes Tk < kM, which is part of 
assumption When i = k, (IT^ is Tk < Sk, which is the desired conclusion in ( TTTi) . 
For the induction step, suppose (IT^ holds for i with < i < k. To prove it for i + 1, 
we need to check that 

Tk < min{r,+2 - I, S^ + M} + (k - i - 1)M. (13) 

The fact that Tk < Ti+2 -l + {k-i- l)M follows from (P, while Tk<Si + {k- i)M 
is just the induction hypothesis. This proves (|T3|l . □ 

4 Two-block words 

We prove Theorem 1(b) in this section. For p, q,j > 0, define 

ffpj = P(t-i < M, . . . , Tp < M, Tp+j > pM), 
= P(ri < M, . . . , Tp < M, Tp+, < pM) 

and ^ ^ 

Here we have used o^j + a'^ j = a^. Note that Up^ = and -u^g = a^. The next result 
will allow us to compute cTpj fairly explicitly. 
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Lemma 10. Forp,j > and arbitrary I, 

i=0 ^ ^ 



Proof. Use the fact that for any geometric random variable r, the conditional distribu- 
tion of r — M given r > M is the same as the distribution of r, to write 

Pin <M,...,T^<M,Tk> IM) -P{n<M,..., Tm+1 <M,Tk> IM) 
= P(ri < M, . . . , r„ < M, r„+i > M, Tk > IM) 
= pPin <M,...,Tm<M,Tk>il- 1)M) 

for any k > m. Now use induction on m, together with the relation + (.™'^) = 



Lemma 11. Forp,q > 0, P{Wp,q is M-seen) < 



Proof. If Wp^q is seen at (mi, . . . ,mp^q) where m^+i — nii < M for all i, then ti < 
M, . . . ,Tp < M by Proposition [7|(b). Furthermore, if 1 < j < g and Tp+j > M, then 
Tp+j < nip^i < {p + 1)M by Proposition [HI Considering the largest j > I (if any) for 
which Tp+j > M, we see that 

<? 

P(iyp,, is seen) <P{n<M,..., Tp+q < M) + ^ a^j, 

i=i 

where 

= P(ri < M, . . . , Tp < M, Tp+j > M, Tp+j+^ <M,...,Tp+q< M, Tp+j < {p + 1)M). 
Using the same trick as in the proof of the previous lemma we obtain 

ap,, = a'^'^Pin <M,...,Tp<M, Tp+, > M, Tp+, <{p+ 1)M) = a'^-' (3a'^,,, 
so the result follows by definition of Up^q. □ 
For any function /p g, p,q > , define a generalized mixed second derivative by 
A/p,g = fp+1,,+1 - M/5/p,,+i - (a - /?)/p+i,, + /5(M - 2a)fp,q. 

This particular choice of coefficients is designed to correspond to the coefficients in (jlj). 
In order to do so, the middle coefficients would have to sum to —{a + (M — l)/3). This 
particular decomposition was chosen by computing numerically Aup^q for various values 
of the parameters, and checking to see which one made this expression < 0. 

Lemma 12. Aup^q < for p,q > 0. 
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Proof. If fp^q = a^, then 

A/p_g = aP+i - M(3aP -{a- (3)aP+^ + f3{M - 2a)aP = 0. 
Therefore, if we let 



i=i 

we have Aup^g = —^Awp^g, so we need to show that Awp^g > 0. We will do so by 
computing the generating function of this expression as a function of q. 
By Lemma (TUl 



^P'^ = E (^) i-f^r-'PiTp^, > ^M). 

i—n \ / 



i=0 

Since is a sum of m independent geometric random variables and can thus be inter- 
preted as the waiting time for the m-th success in a sequence of Bernoulli experiments, 
and since Sk, the number of successes in the first k of these experiments is binomially 
distributed, we have 



P(T,>A;) = P(^,<,) = ^X:(;)- 

1=0 ^ ^ 



Therefore, since 2 ^ = f3, 



'^p,j 



E (fj(-^r-v E Cf ) - ^'E E (-o-f^) ('f 



i=0 ^ ^ 1=0 ^ ^ i=0 1=0 

It follows that for < x < 1, 

(1 - X) = E E (^) f f ) (14) 

j=l i=0 1=0 ^ ^ ^ ' 

Note that for every polynomial function g of degree deg g < p have 

p 



E(-)^w(-ir^ = o- 



(15) 



It suffices to check this for all polynomial functions of the form g{i) = (^^ , Q < I < p: 

Ef!)(;)Mr-(:)E(!::)(-r-(^)(-ir'-o- 
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Applying (fT5|) to the functions g{i) = ) , < / < p, it follows that the positive part 
at the end of ([HI) is not needed: 



j=l 1=0 1=0 ^ ^ ^ ' 

i=0 

Therefore, 

(1 - x) V W''-^ = 7^ = P^x-P[{l + xY' - If. 

I — ax ^ 1 — ax 

q=\ j = l 

Using this expression, we can write 

{i-x)f2 Aw,,,x'^-' = ^^^^[(1 + xr - irp{x), 

^-^ x^ 1 — ax 

ij=0 ^ ' 

where 

P(x) = (1 + xf^ll - (a - (3)x] -l-{M + (3-a)x + x\M - 2a). 

Note that -P(l) = 2*^(1 — a + /3) — 2 = 0, so we may define a polynomial Q by P(x) = 
(1 — x)Q{x). If Q has nonnegative coefficients, it will follow that Awp^q > for all 
p,q > as required. The coefficients of Q are the partial sums of the coefficients of P. 
The constant and linear terms in P vanish. The coefficient of x^ is (^^) — 2(a — M(3), 
while for k > 3, the coefficient of x'^ is (^^) — (a — /?)(^^^). Therefore, we need to check 
that the following expression is nonnegative for / > 2: 

(f)-.«-M„.g[(:)-,„-„(^.-) 

k — 2 k — / 

This is nonnegative for 2 < I < M since /? Ylt=o (^) ~ ^' ^^"^ ^ ~ ^ since then the 
right side above is 1 — 2/?. □ 

Proof of Theorem 1(h). Let 

5p,q = Vp+q - Up^g, P,q>0. 
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By ([3]) we have Vn+i > afn and thus f„ > a", so 

^pfi = Vp — > and (5o,g = Vg — a'^ > for all g > 0. (16) 

By Olj) and Lemma [12] we have 

- - (a - + /5(M - 2a)5p,g 

= Vp+g+2 - (a + (M - l)/3)t;p+g+i + /3(M - 2a)vp+q - Aup^q > 0, 

which can be rewritten as 

- M(3Sp^g+i >{a- - (3{M - 2a)(5p,,. (17) 

We will now prove by induction on q the statement that (5p+i,g > Mp6p^q for all p > 0. 
By ( IT6l) . for the basis step we have to show that 

t;„4-i-a""*"'>M/3(i;„-a"). 

This follows from 

Vn+i - MPvn = (a - M/5)(i;„ + v'J > (a - M(3)a'', 

where we have used (E]), a — M[3 > 0, > 0, and f„ > a". For the induction step, 
assume that the statement is true for a given g > 0. Using ( fT6|) it follows that 5p^q > 
for that q and all p. Therefore, since < M — 2a < M{a — (3) (which is equivalent to 
M(3 < a), (fTTIl can be written as 

Mp6p,g+i > —j^[6p+i,g - M(36p^g] > 0, 

where the final inequality follows from the induction hypothesis. This proves that 
^p+i,g > M(35p^q for all p, g > 0, and hence that 5p^q > for all p,q > and there- 
fore Up^q < Vp+q. Now apply Lemma [TT] to complete the proof. □ 



5 Independence of parameter choice 

Let X = [Xn : n > 1) and Y = (1^ : n > 1) be two independent Bernoulli sequences 
with parameters px and py respectively, and let W G {0, 1}^ be an arbitrary infinite 
word. 

Theorem 13. (a) The validity of the assertion "VM > 2 : P{X is M-seen in Y) = 0" 
does not depend on the values of px,Py ^ (0, 1). 

(b) The validity of the assertion " VM > 2 : P{W is M-seen inY) =0 " does not 
depend on the value of py € (0, 1). 

The main idea of the proof of Theorem [13] is to use a coupling of two Bernoulli 
sequences so that one can be M-seen in the other for sufficiently large M. 
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Lemma 14. Let X,X' be Bernoulli sequences with parameters p,p' G (0, 1) respectively. 

(a) If p' G [p"^, 1 — (1 — pY], there is a coupling such that X' can be 3-seen in X . 

(b) There is an M > 2 and a coupling such that X' can be M-seen in X . 

Proof, (a) Let X be a Bernoulli sequence with parameter p. For a given sequence 
X G {0, 1}^, we define a random subsequence F{x) by partitioning x into disjoint blocks 
each comprising 2 consecutive letters, and replacing every block 00 by 0, 11 by 1, and 
01 and 10 by 1 with probability pi and by with probability po = 1 — pi. This is done 
independently for each block and independently of the choice of the Bernoulli sequence 
X. The sequence X' := F{X) is a Bernoulli sequence with parameter 

p' = + 2p{\ — p)pi 

such that X' can be 3-seen in X (since for every A;, = X2k or X2k^i)- Since pi can 
be chosen arbitrarily in [0, 1], p' has a possible range of [p^, 1 — (1 — p)^]. 
(b) Let = p^ and f2{p) = 1 — (1 — p)^. We note that fi{p) < p < f2{p) and 

flip) ~^ and — i> 1 for k ^ oo, where /'^(p) = f ° ■ ■ ■ ° f{p)- Thus for arbitrary 
given p,p' G (0, 1), there exist po, . . . ,Pk G (0, 1) such that po = p, Pk = p' and pi+i G 
f2iPi)] for all i. Thus by (a) there exist Bernoulh sequences X^'^\ . . . jX^'^^ such 
that X*^*^ has parameter pi and X*^*"*"^) can be 3-seen in X(') for all i. Thus X' := X^'^) 
can be 3*^-seen in X := X^'^^ □ 

Proof of Theorem [73l (a) Let PxiPyiP'xiP'y ^ (0) l)- -^Y Lemma [H] there are Bernoulli 
sequences X, y, X', F' with these parameters such that X' can be Mx-seen in X, F can 
be My-seen in Y' , and (X, X') is independent of {Y^Y'). In particular we have 

{X is M-seen in Y} C {X' is My M-seen in Y'} for all M. 

Thus the validity of "VM' > 2 : P{X' is M'-seen in Y') = 0" implies the validity of 
"VM > 2 : P(X is M-seen in Y) = 0". 

(b) Let Py,Py ^ (0) By Lemma there are Bernoulli sequences Y,Y' with these 
parameters such that Y can be My-seen in Y'. In particular we have 

{W is M-seen in Y} C {W^ is My M-seen in Y'} for all M. 

Thus the validity of "VM' > 2 : P(iy is M'-seen in Y') = 0" implies the validity of 
"VM > 2 : P(iy is M-seen in Y) = 0" . □ 

6 Variance of the number of embeddings 

We prove Theorem HI Let W = {wi, W2, ■ ■ ■ , Wn) be a word of length n, and let j = 
(jO) Ji) • • • ) in) and k = {ko, ki, . . . , kn) be strictly increasing sequences of integers with 
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jo = ko = and gaps not exceeding M. Then 

E{N^) = P{j and k are M-admissible embeddings) 

j,k j,k 

where 

jUk = {ji, ja . . . , jn} U {/ei, /i;2, • • • , k^} viewed as a set, and \j D k\ = |{(r, s) 7^ (0, 0) : 
jr = ks}\. Therefore, 

as claimed in part (a). 

Part (b) follows from the fact that, for the random word X, 

E(/,,fc(X)) = (i)b-nfch^O», 

where Z{j, k) = \{l ^ : ji = ki}\. 

To determine the asymptotics of E{2^"), we proceed as follows. Let t,ti,T2,... be 
iid with the distribution of the hitting time of for the random walk J — K starting at 
0. Then P(Z„ > A;) = P(ri + ■ ■ • + < n), so 

00 

Ei2^-) = 1 + 5^ 2'=-ip(ri + • ■ ■ + Tfc < n), 

k=l 

and therefore, for small positive x, 

°° 1 - Fr'^ 

(l-x)5^x"E(2-") = ^-^. (18) 

?i=0 

The function x — > Ex'^ is smooth on [0, 1), so if we define c = CAf > 1 by Ec^^ = |, 
the right side of flTSl) is asymptotic to a constant multiple of (1 — cx)~^ as a; t c^^. By 
a Tauberian theorem (e.g.. Theorem 5 in Section XIII.5 of it follows that E{2^") 
is asymptotic to a constant multiple of c". When M = 2, i^x"^ = 1 — \^T^^^ (see, for 
example. Section XIV. 4 of |3j), so C2 = |. 

In order to get full benefit from the Tauberian theorem, we need to know that the 
sequence E{2^")c~^ is monotone; otherwise we would only have convergence in the 
Cesaro sense. We check this next. Let Un = P{Jn — Kn = 0), vq = 1, 

n 

Vn = ^UkVn-k, n>l, (19) 
k=l 
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and Vn = '^^=oVk. We will check that 

Un i, (20) 

> K+iK-1, n > 1 (21) 

and 

E{2^-) = Vn. (22) 

Once these are checked, it will follow from fl2T]) that Vn+i/Vn is decreasing, and by fl22|) 
and the Cesaro convergence noted above, Vn+i/Vn I c. Therefore Vn+i > cVn, so Ki/c" 
is increasing as required. 

To check ( l20l) . note first that by the Schwarz inequality. 



P( J„ -Kn = m) = J2 P{Kn = l)P{Jn = I + 



= 5^p^(ir„ = 



Ur,. 



Then use this to write 

Wi = XI ^'^'^'^ - ^n = m)P{Ji ~Ki = -m) < Ur, 



We check (1?T]) by induction on n. Note that it is equivalent to 



Vn . Vn+l 



Vn-l Vn 

Dividing ( |T9l) by expresses the left side of ( l23i) as an average of mi, . . . and 
of course the right side is an average of By ( l20l) . it suffices to check 

that the two averaging measures are stochastically ordered. But this is equivalent to 
Vn-iVj > VnVj-i for I < j < n, which is a consequence of the induction hypothesis. 
Finally, we check (122|) . Write 

E{2^-) = Ef[(l + liJi = KA = Yl PiJi = K,,\fzeA). 

i=l ^ ^ AC{l,...,n} 

So, it is enough to show that 



■Jn= Yl P{Ji = Ki,VteA), 

n€AC{l,...,n} 



or equivalently, that the right side above satisfies the recursion ( fT9l) . But this is easily 
checked by breaking up the sum according to the value of the smallest element k of A. 
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